Link-based Community Detection with the Commute-Time Kernel
نویسندگان
چکیده
The main purpose of this work is to find communities in a weighted, undirected, graph by using kernel-based clustering methods, directly partitioning the graph according to a well-defined similarity measure between the nodes (a kernel on a graph). The algorithm is based on a two-step procedure. First, the sigmoid commute-time kernel (KCT), providing a meaningful similarity measure between any couple of nodes, is computed from the adjacency matrix of the graph. Then, the nodes of the graph are clustered by performing a kernel clustering on this CT kernel matrix. For this purpose, simple, prototype-based, kernel versions of the k-means, the fuzzy k-means, the entropy-based fuzzy k-means, the gaussian mixtures model, as well as Ward’s hierarchical clustering, are introduced. The joint use of the CT kernel matrix and kernel clustering appears to be quite effective. Indeed, this methodology provides good results, outperforming the spherical k-means and spectral clustering, on a document clustering problem involving the newsgroups database, where the set of documents is viewed as a graph. Finally, the links between the proposed hierarchical kernel clustering and spectral clustering are examined.
منابع مشابه
An experimental investigation of kernels on graphs for collaborative recommendation and semisupervised classification
This paper presents a survey as well as an empirical comparison and evaluation of seven kernels on graphs and two related similarity matrices, that we globally refer to as "kernels on graphs" for simplicity. They are the exponential diffusion kernel, the Laplacian exponential diffusion kernel, the von Neumann diffusion kernel, the regularized Laplacian kernel, the commute-time (or resistance-di...
متن کاملGraph nodes clustering with the sigmoid commute-time kernel: A comparative study
This work addresses the problem of detecting clusters in a weighted, undirected, graph by using kernel-based clustering methods, directly partitioning the graph according to a welldefined similarity measure between the nodes (a kernel on a graph). The proposed algorithms are based on a two-step procedure. First, a kernel or similarity matrix, providing a meaningful similarity measure between an...
متن کاملCommunity Detection using a New Node Scoring and Synchronous Label Updating of Boundary Nodes in Social Networks
Community structure is vital to discover the important structures and potential property of complex networks. In recent years, the increasing quality of local community detection approaches has become a hot spot in the study of complex network due to the advantages of linear time complexity and applicable for large-scale networks. However, there are many shortcomings in these methods such as in...
متن کاملRobust Multi-body Motion Tracking Using Commute Time Clustering
The presence of noise renders the classical factorization method almost impractical for real-world multi-body motion tracking problems. The main problem stems from the effect of noise on the shape interaction matrix, which looses its block-diagonal structure and as a result the assignment of elements to objects becomes difficult. The aim in this paper is to overcome this problem using graph-spe...
متن کاملAn Experimental Investigation of Graph Kernels on Collaborative Recommendation and Semisupervised Classification
This paper presents a survey as well as a systematic empirical comparison of seven graph kernels and two related similarity matrices (simply referred to as graph kernels), namely the exponential diffusion kernel, the Laplacian exponential diffusion kernel, the von Neumann diffusion kernel, the regularized Laplacian kernel, the commute-time kernel, the random-walk-with-restart similarity matrix,...
متن کامل